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Does the IMF vary? Is it significantly different in metal-rich environments than in metal-poor 
ones? Theoretical work predicts this to be the case. But in order to provide robust empirical 
evidence for this, the researcher must understand all possible biases affecting the derivation 
of the stellar mass function. Apart from the very difficult observational challenges, this turns 
out to be highly non-trivial relying on an exact understanding of how stars evolve, and how 
stellar populations in galaxies are assembled dynamically and how individual star clusters and 
associations evolve. N-body modelling is therefore an unavoidable tool in this game: the case 
can be made that without complete dynamical modelling of star clusters and associations any 
statements about the variation of the IMF with physical conditions are most probably wrong. 
The calculations that do exist demonstrate time and again that the IMF is invariant: There 
exists no statistically meaningful evidence for a variation of the IMF from metal-poor to metal- 
rich populations. This means that currently existing star-formation theory fails to describe the 
stellar outcome. Indirect evidence, based on chemical evolution calculations, however indicate 
that extreme star-bursts that assembled bulges and elliptical galaxies may have had a top-heavy 
IMF. 



1. Introduction 

The stellar initial mass function (IMF), ((m)rfm, where m is the stellar mass, is the 
parent distribution function of the masses of stars formed in one event. Here, the number 
of stars in the mass interval m, m + dm is dN = £(m) dm. Salpeter (1955) inferred the 
IMF from solar-neighbourhood star-counts applying corrections for stellar evolution and 
Galactic-disk structure finding £(m) w km~ a ,a w 2.35, for 0.4 ^m/M ^ 10. Miller 
& Scab (1979) and Scab (1986) derived the IMF for 0.1 - 60 M© using better data 
and a more sophisticated analysis establishing that the IMF flattens or turns-over at 
small masses. Modern studies of solar-neighbourhood star-count data which also apply 
detailed corrections for unknown multiple systems in the star-counts, confirm that cti = 
2.2 ± 0.3 for 0.5 <m/M Q < 1 (Kroupa, Tout & Gilmore 1993; Kroupa 1995b; Reid, Gizis 
& Hawley 2002). Using spectroscopic star-by-star observations, Massey (2003) reports 
the same slope or index a-i = 2.3 ± 0.1 for m <; 10 Mq in many OB associations and star 
clusters in the Milky Way (MW), the Large- and Small-Magellanic clouds (LMC, SMC, 
respectively). It is therefore suggested to refer to a 2 = 2.3 as the Salpeter /Massey slope 
or index. It is valid for to^0.5Mq. 

The IMF is, strictly speaking, a hypothetical construct because any observed system 
of N stars merely constitutes a particular representation of a distribution function. The 
probable existence of a unique £(m) can be inferred from observations of many ensembles 
of such TV systems (e.g. Massey 2003). If, after corrections for 

(a) stellar evolution, 

(b) unknown multiple stellar systems and 

f To appear in The Metal Rich Universe, La Palma, Canary Islands, June 12th-16th, 2006, 
G. Israelian & G. Meynet (eds.), Cambridge University Press 
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(c) stellar-dynamical biases, 
the individual distributions of stellar masses is similar within the statistical uncertainties, 
then we (the community) deduce that the hypothesis that the stellar mass distributions 
are not the same can be excluded. That is, we make the case for a universal, standard 
or canonical stellar IMF within the physical conditions probed by the relevant physical 
parameters (metallicity, density, mass) of the populations at hand. 

This canonical IMF is a two-part-power law, the only structure with confidence found 
so far being the change of index from the Salpeter/Massey value to a smaller one near 
0.5 Mq: 

ai = 1.3 ±0.5, , 0.08 < m/M Q < 0.5, , . 

a 2 = 2.3±0.5, , 0.5 < m/M Q < 150. ^ ' 

It has been fully corrected for unknown multiple stellar systems in the low-mass (m < 
1 Mq) regime, while multiplicity corrections in the high-mass regime await to be done. 
The evidence for a universal upper mass cutoff near 150 M Q (Weidner & Kroupa 2004; 
Figer 2005; Oey & Clarke 2005; Koen 2006) seems to be rather well established in popu- 
lations with metallicities ranging from the LMC (Z » 0.008) to the super-solar Galactic 
centre (Z>0.02) such that the stellar mass function (MF) simply stops at that mass. 
This mass needs to be understood theoretically (see discussion in Kroupa & Weidner 
2005). 

Chabrier (2003) offers a log-normal fornjj which fits the canonical form quite well (e.g. 
Romano et al. 2005). 

Below the hydrogen-burning limit, there is substantial evidence that the IMF flattens 
further to a « 0.3 ± 0.7 (Kroupa 2001a; Kroupa 2002; Chabrier 2003). Therefore, the 
canonical IMF most likely has a peak at 0.08 Mq. Brown dwarfs, however, comprise only 
a few % of the mass of a population and are therefore dynamically irrelevant. Note that 
the logarithmic form of the canonical IMF, ^(m) = ln(10) m £(m), which gives the 
number of stars in logioTO-intervals, also has a peak near 0.08 Mq. However, the system 
IMF (of stellar companions per binary combined to system masses) has a maximum in 
the mass range 0.4 — O.6M (Kroupa et al. 2003). 

The above form has been derived from detailed considerations of local star-counts 
thereby representing an average IMF: for low-mass stars it is a mixture of stellar popula- 
tions spanning a large range of ages (10 — Gyr) and metallicities ([Fe/H] ^ 0), while for 
the massive stars it constitutes a mixture of different metallicities ([Fe/H] ^ — 1.5) and 
star-forming conditions (OB associations to very dense star-burst clusters: R136 in the 
LMC). Therefore it can be taken to be a canonical form, and the aim is to test whether 
even more extreme star-forming conditions such as found in super-metal rich environ- 
ments or super-dense regions may deviate from it. Any systematic deviations of the IMF 
with physical conditions of the environment would constrain our understanding of star 
formation, would give us a prescription of how to set-up stellar-dynamical systems, and 
last not least would allow more precise galaxy- formation and evolution calculations. 



f Note that the log-normal form is physically better motivated in the sense that a physical 
process will not abruptly change a slope as in the canonical IMF, but it also needs to be 
extended by a power-law above 1 Mq to meet the needs of the observational data thereby 
losing its advantage. A reason why the author prefers to use the canonical form is mathematical 
simplicity and the ease with which parts of it can be changed without affecting other parts. 
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2. The expectation: the IMF must depend on star- format ion 
environment and in particular on the metallicity 

There are two basic arguments suggesting that the IMF ought to be dependent on the 
physical conditions of star formation: 

2.1. The Jeans-mass argument 

A) A region of a molecular cloud undergoing gravitational collapse will have over-dense 
sub-regions within it which are also Jeans-unstable collapsing independently to form 
smaller structures which themselves may again sub- fragment (e.g. Zinnecker 1984). Ulti- 
mately stars result. The essence of this concept is that a region spanning a Jeans length 
which has at least a Jeans mass undergoes gravitational collapse. The Jeans mass de- 
pends on the temperature and density of the cloud, Mj eans oc T 3 / 2 p -1 / 2 (e.g. Larson 
1998; Bonnell, Larson & Zinnecker 2006). Now, in metal-rich environments there is more 
dust and therefore the collapsing gas can cool more effectively reducing T and increasing 
p. Thus, 

[Fe/H] t => fragment masses j . (2.1) 

B) The fact that the IMF is not a featureless power-law but has structure in the mass 
range 0.08 ^ m/M Q ^ 0.5 suggests there to be a characteristic mass of a few 0.1 M Q . 

Bonnell et al. (2006) suggest that this characteristic mass of fragmentation may be a 
result of the coupling of gas to dust such that there is a change from a cooling equation 
of state, where T oc p~ - 25 while the density increases, to one with slight heating at high 
densities, T oc p 01 . Again, this implies a dependency of the characteristic mass on the 
metallicity through the cooling rate: 

[Fc/H] | fragment masses I . (2.2) 

There seems to be observational evidence supporting the notion that stellar masses 
are derived from Jeans-unstable mass fragments: pre-stellar cloud cores are found to be 
distributed like the canonical IMF in low-mass star formating regions (Motte, Andre & 
Neri 1998; Testi & Sargent 1998; Motte et al. 2001; but see Nutter & Ward-Thompson 
2006). 

While the concept of the Jeans mass is very natural and allows one to nicely visualise 
the physical process of fragmentation, it has the problem that the densest regions of a 
pre-star-cluster cloud core ought to have the smaller fragment masses, but instead the 
most massive stars are seen to form in the densest regions. 

2.2. The self-limitation argument 

A rather convincing physical model of the IMF which avoids the problem with the Jeans 
mass argument has been suggested by Adams & Fatuzzo (1996) and Adams & Laughlin 
(1996). The argument here is that the Jeans mass has virtually nothing to do with the 
final mass of a star because structure in a molecular cloud exists on all scales. Therefore, 
no characteristic density can be identified, and "no single Jeans mass exists". When a 
cloud region becomes unstable a hydrostatic core forms after the initial free-collapse. 
This core then accretes at a rate dictated by the physical conditions in the cloud. The 
metallicity influences the accretion rate through the sound velocity (higher sound velocity, 
larger accretion rate) by steering the cooling rate (more metals more dust => lower 
T => smaller sound velocity). The many physical variables describing the formation of a 
single star have distributions, and folding these together yields finally an IMF in broad 
agreement with a log-normal shape (as also shown by Zinnecker 1984). This fails though 
at large masses, where the IMF is a power-law and where additional physical processes 
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probably play a role (coagulation, competitive accretion). The important point however 
is that this theory also expects a variation of the resulting characteristic mass with 
metallicity as above: 

[Fe/H] t => characteristic mass J. . (2-3) 

2.3. Robust implication? 

Thus, both theoretical lines of argument seem to suggest the same qualitative behaviour, 
namely that the IMF ought to shift to smaller characteristic masses with increasing metal- 
licity. This would therefore seem to suggest a very robust if not fundamental expectation 
of star-formation theory. 

Is it born out by empirical evidence? The best way to test this theoretical expectation 
is to measure the IMF in metal-poor environments and to compare to the shape seen in 
metal-rich environments. 

A measurement of the IMF in a population with super-solar abundance would be 
especially important. Unfortunately this is extremely difficult, because populations with 
super-solar abundances are exceedingly rare. Only one star-cluster with [Fe/H]« +0.04 
is known, NGC 6791, but its ancient age of « 10 Gyr implies it to be dynamically very 
old. Evaporation of low-mass stars will thus have significantly affected the shape of the 
present-day mass function, which has not been measured yet (King et al. 2005). Other 
metal-rich environments constitute the central MW region (§ 15. 2p and galactic spheroids 
(§ 15. 3p . the latter allowing only indirect evidence on the IMF through their chemical 
properties. 

In general, measurements of the IMF are hard, because stellar masses can only be 
inferred indirectly through their luminosity, I, which also depends on age, metallicity 
and the star's spin angular momentum vector, collectively producing a distribution of I 
for a given m. The empirical knowledge that can be gained about the IMF is discussed 
next. 



3. The shape of the IMF 

3.1. The local stellar sample 

The best knowledge about the IMF we glean from the local volume-limited stellar sample. 
The basic technique is to construct the stellar luminosity function (LF), f (Mp), where 
Mp is the stellar absolute magnitude in some photometric pass band (we are stuck 
with magnitudes rather than working with the physically more intuitive luminosities due 
to our historical inheritance). The number of stars within the complete volume in the 
magnitude interval Mp, Mp + dMp is then dN = $>(Mp) dMp. These are the same stars 
as enter dN = £(m) dm from above, and thus results our master equation 

dm , „ . 
*(Mp) = -j^ £(m). (3.1) 

The observable is $ which we get from the sky. Our target is £, and the hurdle is 
the derivative of the stellar mass-luminosity relation, dm/ dMp. This is quite prob- 
lematical, because we can only get at dm/ dMp by either constructing observational 
mass-luminosity relations using extremely well-observed binary stars with known Ke- 
pler solutions which nevertheless have uncertainties that are magnified when considering 
the derivative, or we can resort to theoretical stellar models which give us well defined 
derivatives but depend on theoretically difficult processes within stellar interiors (opac- 
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ities, convection, rotation, magnetic fields, equation of state, nuclear energy generation 
processes). 

There are two basic local LF's: (I) We can count all stars within a trigonometrically- 
dcfined distance limit such that the stellar sample is complete, i.e. we can see all stars of 
magnitude Mp,Mp + dMP within a distance r t . The volume-limited sample for solar- 
type stars having excellent parallax measurements extends to r t s=s 20 — 30 pc, while for 
the faintest M-dwarfs rt ~ 5 — 8 pc (Reid, Gizis & Hawley 2002). Tests of completeness 
are made by comparing the stars with Mp within rt to the number of these stars in a 
volume element further out (Henry et al. 2002), finding that faint stars remain to be 
discovered even within a distance of 5 pc. For this nearby LF, ^ncraby, the stars are well 
scrutinised on an individual-object basis, and geometric distances are known to within 
about 10 per cent. At the faint end \P n carby is badly constrained resting on only a few 
stars. 

(II) Prompted by the "discovery" of large amounts of dark matter in the MW disk 
(Bahcall 19840, novel deep surveys were pioneered by Reid & Gilmore (1982). This 
second-type of sampling can be obtained by performing deep, pencil-beam photographic 
or CCD imaging surveys through the Galactic disk. From the 10 5 images the typically 100 
or so main sequence stars need to be gleaned using automatic image-, colour and bright- 
ness recognition systems. The distances of the stars are determined using the method of 
photometric parallax, which relies on estimating the absolute luminosity of a star from 
its colour, and then calculating its distance from the distance modulus. The resulting 
flux-limited sample of stars has photometric-distance limits within which the counts are 
complete. The distance limits decrease for fainter stars. 

Clearly, while only one nearby Li 7 " exists, many photometric LFs can be constructed for 
different fields of view. Each observation yields a few dozen to hundreds of stars, and so 
the overall sample size becomes very significant. The various surveys have shown ^Pphot 
to be invariant with direction. This should, of course, be the case, since the Galactic- 
field stars with an average age of about 5 Gyr have a velocity dispersion of about 25- 
50 pc/Myr such that within 200 Myr a volume with a dimension of the survey volumes 
(a few hundred pc) is completely mixed. 

It therefore came as a surprise that "J near and ^Pphot are significantly different at faint 
luminosities (Fig. [1]). 

3.2. The mass-luminosity relation 

When not understanding something the best strategy to continue is sometimes to simply 
"forget" the problem and continue the path of least resistance. Thus, while the prob- 
lem \tnear 7^ ^photj^fy > 12 could not be explained immediately, it turned out to be 
constructive to first ascertain which LF shape must be the correct one using entirely 
different arguments. 

In eg. 13. li the slope of the stellar mass-luminosity relation of stars enters posing a clue. 
Fig. 2 in Kroupa (2002) shows the mass-luminosity data of binary stars with Kepler 
orbits, and demonstrates that a non-linearity exists near 0.33 M Q such that a pronounced 
peak in —dmjdMy appears at My « 11.5 with an amplitude and width essentially 
identical to the maximum seen in the photometric LF at this luminosity (Fig. [T]). This 
agreement of 

(a) the location of the maximum and 

(b) the amplitude and 

f The evidence for dark matter within the solar vicinity disappeared on closer scrutiny (Kui- 
jken & Gilmore 1991). 
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Figure 1 . Stellar luminosity functions (LFs) for solar- neighbourhood stars. The photometric LF 
corrected for Malmquist bias and at the mid-plane of the Galactic disk ($ p hot) is compared with 
the nearby LF (f ncar ). The average, ground-based ^phot (dashed histogram, data pre-dating 
1995, Kroupa 1995a) is confirmed by Hubble-Space- Telescope (HST) star-count data which pass 
through the entire Galactic disk and are thus less prone to Malmquist bias (solid dots, Zheng et 
al. 2001). The ground-based volume-limited trigonometric-parallax sample (dotted histogram) 
systematically overestimates *& n car due to the Lutz-Kelker bias, thus lying above the improved 
estimate provided by the Hipparcos-satellite data (solid histogram, JahreiB & Wielen 1997; 
Kroupa 2001b). The depression/plateau near Mv = 7 is the Wielen dip. The maximum near 
My ~ 12, Mi ~ 9 is the KTG peak. The thin dotted histogram at the faint end indicates the 
level of refinement provided by recent stellar additions (Kroupa 2001b) demonstrating that even 
the immediate neighbourhood within 5.2 pc of the Sun probably remains incomplete at the 
faintest stellar luminosities. 



(c) the width of the extremum 
convincingly suggest stellar astrophysics to be the origin of the peak in the LF, rather 
than the MF. The Wielen dip (Fig. Q]) similarly results from subtle structure in the 
mass-luminosity relation. Thus, simply by counting stars on the sky we are able to direct 
our gaze within their interiors: It is the internal constitution of stars which changes with 
changing main-sequence mass and this is what drives the structure in the mass-luminosity 
relation. 

Having thus established that the peak in the LF must, in fact be there where it is also 
found as a result of fundamental astrophysical processes, we can test this result using star- 
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Figure 2. J-band LFs of stellar systems in four star clusters: globular cluster (GC) M15 (de 
Marchi & Paresce 1995a, distance modulus Am = m-M = 15.25 mag); GC NGC 6397 (Paresce 
et al. 1995, Am = 12.2); young Galactic cluster Pleiades (Hambly et al. 1991, Am = 5.48); GC 
47 Tue (de Marchi et al. 1995b, Am = 13.35). 



clusters which constitute single-age, single-metallicity and equal-distance stellar samples. 
Fig. [2] does exactly this, and indeed, a very pronounced peak is evident at exactly the 
right location and with the right width and height (Kroupa 2002: fig. 1). We can therefore 
trust the peak in ^phot- In Fig. Q]it can be seen that ^noar also shows evidence for this 
peak, noting that the peak is smeared apart in the local stellar sample because the stars 
have a wide spread in metallicities. The metallicity-dependence of the peak has been 
shown to be in agreement with the LFs of star clusters over a large range of metallicity 
(von Hippel et al. 1996; Kroupa & Tout 1997). 

Kroupa, Tout & Gilmore (1993) then performed a trick to get at the correct mass- 
luminosity relation without having to resort to theoretical or purely empirical relations 
which are very uncertain in their derivatives (Kroupa 2002: fig. 2): The Malmquist- 
corrected 'Fphot is used to define the amplitude and width and location of the extremum 
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in the single-age, single-metallicity average dm/dMy, and integrating the resulting con- 
straint leads to a semi-empirical m(Mv) relation. It is a semi-empirical relation because 
we used theoretical stellar models to place, one can say, zeroth order constraints on this 
relation, i.e. to prove the existence of the extremum and to estimate its location and 
width and amplitude. With this theoretical knowledge in-hand we then used the LF to 
constrain the detailed run of m(My). The result is in amasing agreement even with the 
most recent high-quality binary-star constraints by Delfosse et al. (2000). With the so- 
obtained m(My) relation, which has the correct derivative, it is now possible to take 
another step towards constraining the MF. 

But first the problem ^noar 7^ ^phot; Mv > 12 needs to be addressed. 

3.3. Unresolved binary systems 

One bias clearly affecting ^phot are unresolved multiple systems; in constructing V&near 
all stellar companions have been counted individually, while ^phot consists of "stars" too 
far away to be resolved if they are multiple. Also, a faint companion star may not be 
seen because it lies below the flux limit while the primary enters the formal photometric 
distance limit. Furthermore, an unresolved binary system appears redder unless both 
stars are of equal mass and this affects the calculated stellar space density because the 
photometric distance is misjudged. All the effects can and must be corrected for; this 
having been done thoroughly for the first time by Kroupa et al. (1993). 

The bias due to unresolved binaries affecting "Jphot can be nicely demonstrated using 
realistic numbers for the multiplicity in the Galactic field (Goodwin et al. 2006): Suppose 
the observer sees 100 systems on the sky. Of these 40 are binaries, 15 triples and 5 
quadruples. The multiplicity fraction is / = (40 + 15 + 5)/100 = 0.6. The observer will 
construct the LF using these 100 systems, but in fact 85 stars are missed. All of these 
are much fainter than their primary, implying a non-linear depression at the faint end of 
the LF. 

3.4. The standard Galactic- field IMF 

Together with a thorough modelling of the star-formation history of the solar neigh- 
bourhood (low- mass stars take long to descend to the main sequence), local Galactic 
disk structure and metallicity spreads, as well as different fractions of multiple stars and 
photometric and trigonometric distance uncertainties, Kroupa et al. (1993) performed 
a multi-dimensional optimisation for the parameters of a two-part-power law MF. The 
result is one MF which unifies both 'i'near and ^phot- It is given by eq. 11.11 for m ^ 1 Mq 
(Kroupa et al. 1993; Reid et al. 2002). Adopting this IMF as being correct, Kroupa 
(1995b) also showed that the observed LFs are arrived at if stars are born as binaries in 
clusters which evolve and disperse their stellar content into the Galactic field. Chabrier 
(2003) further points out that non-linearities in the colour-magnitude relation used in 
photometric parallax adds to the underestimate of the stellar densities at the faint end 

Of * p hot- 

Scalo (1986) had performed a very detailed analysis of the stellar mass function, which 
was superseded for m 1 Mq through new star-count data and the new modelling de- 
scribed above. But for m ^ 1 Mq his analysis remains valid. Using star counts out to kpc 
distances for early-type stars he modelled their spatial distribution and took account of 
their production through different star-formation rates, and arrived at an estimate of the 
Galactic-field IMF adopted in Kroupa et al. (1993): 

a 3 = 2.7±0.7, , l<m/M Q . (3.2) 

Together with eg. Il.ll th.is is the standard Galactic- field IMF. Elmegreen & Scalo (2006) 
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then showed that there can be artificial features in the massive-part of the Galactic-field 
IMF when it is deduced from the present-day field MF under varying star- formation rates 
(SFRs), frustrating attempts to attribute any possible structure there to star- formation 
physics. 

3.5. The canonical IMF 

As already stated in § [TJ the IMF for early-type stars can also be inferred for individual 
OB associations and star clusters. Massey (2003 and previous papers) has shown the IMF 
for massive stars to be independent of density (p ce ntral ~ 10 59 stars/pc 3 , the densest 
cluster being R136 in the LMC) and metallicity (Zf^ 0.002, the population with the 
lowest metal abundance being in the SMC). It always has a = 2.3 ± 0.1, thus leading to 
the standard or canonical stellar IMF, eq. 11.11 

The question now emerges as to why as ca io = 2.7 > aMasscy = 2.3 for m ^ 1 MQ. This 
is referred to as the "Scaslo vs Massey discrepancy" . 

Considering the uncertainties, it would be valid to discard the difference. However, 
using the standard Galactic-field IMF in galactic evolution models would yield about 3 
times fewer stars in the mass range 10 — 150 Mq and about 8 times fewer stars with 
150 Mq than when using the canonical IMF. So it is important to know 03. We will 
return to this later-on, but as before, when not immediately seeing a possible solution 
it often proves useful to ignore the problem until new insights open new avenues for 
exploration. 

3.6. Massive stars come seldomly alone 

Spectroscopic and speckle-interferometric observations of nearby massive star-forming 
regions have shown that OB stars prefer more than one partner (Zinnecker 2003; Goodwin 
et al. 2006). In particular, Preibisch et al. (1999) present the most detailed multiplicity 
study of the Orion Nebula Cluster (ONC) demonstrating that the rule are triples rather 
than binaries as amongst T Tauri stars. 

This allows us to set-up the following hypothesis: 

JMassey < ^ because /scalo < 1 while /Massey ~ 1- (3.3) 
aScalo 

That is, Massey was observing dynamically less-evolved and therefore more binary-rich 
populations whereas Scalo saw the field population which is, to a significant extend, 
"contaminated" by runaway OB stars which are mostly single (Stone 1991). This would 
be a neat solution to our problem: fainter OB stars are hidden among their brighter 
primaries flattening the Massey-IMF, while the mostly single OB-field stars in the Scalo 
sample would be a more correct representation of the true IMF. 

Thus we may be led to conclude that in actuality the true (binary-corrected) IMF is 
cq. l3.2l fa^ « 2.7), while eci. ll.ll faq w 2.3) is wrong. This would, of course, have important 
implications for Galactic astrophysics as well as cosmology; most galaxy-formation and 
evolution calculations are being done with a Salpeter IMF. 

4. Lessons from star clusters 

Star clusters pose star-formation events which are extremely well correlated in space, 
time and chemistry, and so studying the MF within clusters avoids most of the problems 
we need to deal with in the Galactic field. However, there are a number of serious issues 
that essentially annihilate any advantage one might have gained by having an equal-age, 
-distance and -metallicity population: 
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4.1. Young clusters 

To avoid dynamical evolution complicating the star-counts in a cluster, young clusters are 
chosen. However, uncertainties in pre-main sequence tracks make the calculation of stellar 
masses from stellar luminosities and colours or spectra very uncertain for ages less than 
a few Myr. Errors of about 50 % would not be untypical. These errors are not randomly 
distributed though, but cause systematic offsets from the true stars. And so unphysical 
structures appear in the MF. This is discussed in more depth in Kroupa (2002). Very 
young clusters also have a high binary fraction, which again would require corresponding 
corrections. Young clusters evolve very rapidly during the first 1 Myr as a result of 
residual gas expulsion. For example, the ONC with an age of about 1 Myr is already 
between 5 and 15 initial crossing times old, implying a rapid dynamical evolution of the 
binary population (fig. 1 in Kroupa 2000; fig. 9 in Kroupa et al. 2001) and an expansion 
of the cluster such that a substantial fraction of its initial population has probably been 
lost (Kroupa, Aarseth & Hurley 2001). This translates into a selective stellar-mass loss if 
the cluster was initially significantly mass-segregated (Moraux, Kroupa & Bouvier 2004) . 
It is therefore never clear exactly which corrections have to be applied, and an inferred 
stellar MF is very likely not to be a good representation of the IMF. 

4.2. Old clusters 

To avoid the issues above (uncertain pre-main sequence theory, rapidly evolving stellar 
and binary populations), older clusters within which stars are on or close to the main 
sequence would appear to be useful. They are definitely useful, but such clusters are 
heavily evolved dynamically, and so the MF will not represent the IMF even for clusters 
with an age near 100 Myr. Unresolved multiple systems remain a problem, and for ex- 
ample for the Pleiades Stauffer (1984) shows that the fraction of photometric binaries is 
26 % (these are stars that are more luminous than a single star of the same colour) , while 
Kaehler (1999) notes that a true binary fraction between 60 and 70 % may be possible. 
Even globular clusters appear to have a sizeable binary population (e.g. Rubcnstcin & 
Bailyn 1997). 

A star cluster which has managed to survive its initial gas expulsion phase re-virialises 
on a time-scale of tens of Myr (Kroupa et al. 2001). It's binary population is depleted by 
then such that binaries with orbital Kepler velocities smaller than the velocity disper- 
sion in the pre-expansion cluster will have been disrupted. The remaining binaries are 
mostly inert; sometimes energetic binary-single-star or binary-binary encounters near 
the cluster core eject 1-3 stars (e.g. a single star and the binary) from the cluster. The 
cluster's evolution is, however, well described by classical dynamical evolution tracks, 
i.e. the surviving binaries are dynamically not important, as shown explicitly by Kroupa 
(1995c). During this long-lived phase the cluster evolves towards energy equipartition, 
which implies that the low-mass stars gain energy and thus move outwards becoming lost 
to the Galactic tide, while the more massive stars loose energy and segregate towards 
the center of the potential. This process never stops, i.e. a cluster never actually reaches 
energy equipartition and dynamical equilibrium. The consequence is that as the cluster 
ages, it's MF becomes increasingly depleted in low-mass stars. This is nicely evident in 
low-mass clusters in terms of a flattening of the LF, while unresolved binaries remain a 
significant bias despite their dynamical unimportance (fig. 10 in Kroupa 1995c). Baum- 
gardt & Makino (2003) study the evolution of the MF in much detail for massive star 
clusters without binaries. In their fig. 7 they nicely show how the MF changes its slope 
in dependence of the cluster's dynamical age expressed in terms of the fraction of the 
time until cluster disruption: clusters that have passed 50 % of their disruption time have 
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essentially a flat (a « 0) MF for 0.1 m/M© 0.8, while for older clusters a becomes 
negative. 

4.3. Massive stars in clusters 

Two processes compete for all stellar masses, but are particularly pronounced for massive 
stars because of the involved energetics. 

Mass segregation: If massive stars form throughout a forming star cluster energy 
equipartition will force them to segregate to the center within a timescale t msgr ~ 
2 {jYl&v /^massive )*relax, where ?n a v , TO mass i ve are the masses of the average star and the 
massive stars, and i re iax is the median two-body relaxation time, as detailed in Kroupa 
(2004). For example, for the ONC, i re iax ~ 0.6 Myr and m av /m ma ssivc ~ 10 -2 such 
that tmsgr ~ 0.1 Myr <C age of the ONC. No wonder that the ONC sports a beautiful 
Trapezium, although it may also have formed at the centre (Bonnell & Davies 1998). 

Core decay: The massive stars at the cluster centre, which may have been born 
there or segregated there, form a short-lived small- N m core. It decays on a time-scale 
^dccay ~ N m x i C ross, core , where N m is the number of massive stars in the core and Across, core 
is its crossing time. Again, for the ONC, we find (Kroupa 2004) idecay ~ 10 4 — 10 5 yr <C 
its age. So why does the Trapezium still exist? 

4.4. A highly abnormal "IMF" in the Orion Nebula Cluster 

Detailed TV-body computations confirm this time-scale problem (Pflamm-Altenburg & 
Kroupa 2006): The ONC also has a highly abnormal "IMF" - only 10 stars more massive 
than 5 Mq are found in it, while 40 would be needed to allow the very final remnant of the 
central core to be still visible today and to account for the mass of the most massive star, 
given the cluster-mass-maximal-mass relation observed to exist among young clusters 
(Weidner & Kroupa 2006). 

This appears to be the rule rather than the exception, Pflamm-Altenburg & Kroupa 
(2006) discuss the older Upper Scorpius OB association which also shows a deficit of 
massive stars compared to the canonical IMF. Naively this may be taken as good evidence 
for a bottom-heavy present-day IMF in a relatively metal-rich environment, which would 
even be consistent qualitatively with the theoretical arguments of § [2] (higher- metallicity 
environments making lower-mass stars on average). 

Further work, however, shows this to be an illusion: Pflamm-Altenburg & Kroupa 
(2006) apply a specially designed chain-regularisation code (CATENA) to study the 
dynamical stability of ONC-type cores, finding that if the ONC and Upper Scorpius 
OB association, which is understood to be an evolved version of the ONC, were born 
with 40 OB stars, then the observations are well-accounted for. Indeed, Stone (1991), 
among others, shows that 46 % of all O stars are probably runaways of which about 
18 % have line-of-sight velocities ^30 pc/Myr. Energetic dynamical encounters in clus- 
ter cores therefore have a very significant effect on the shape of the "IMF" in clusters 
and OB associations. And, massive stars are rapidly dispersed from their birth sites with 
corresponding implications for the spreading of heavy elements and feedback-energy de- 
position throughout a galaxy. 

The implications of these studies is therefore that the massive-star content of star 
clusters and OB associations is not a measure of the true initial content, unless the full 
dynamical history of the cluster or OB association is taken into account. 

4.5. Summary 

It would thus appear that clusters and OB associations of any age are a horrible place 
to study the IMF. 
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This does not mean, however, that these systems are useless. Quite on the contrary: 
they are indeed our only samples of stars formed together and are therefore the only 
samplings from the IMF that exist before dispersion into the field. The point of the above 
pessimistic note is to instill the fact that any measurement of the "IMF" in a cluster or 
association must be accompanied by a dynamical investigation before conclusions about 
the possible variation of the IMF can be made. An excellent example follows in § 15.21 

5. Evidence for variation of the IMF 

The derivation of MFs in many clusters and associations leads to many values of a 
measured over different stellar mass ranges. The alpha-plot, a(m), can therefore give us 
information about the true shape of the putative underlying IMF, and also of the scatter 
about a mean IMF (Scalo 1998). This scatter can then be studied theoretically using 
the methods mentioned above, i.e. TV-body calculations of star-clusters with different 
properties. Thus, § 14.51 needs to be invoked in the context of some recent interesting 
observational studies that suggest the IMF to vary at low masses (e.g. Oasa et al. 2006) 
and at high masses (Gouliermis et al. 2005). 

One important aspect here is the purely statistical scatter resulting from finite-iV 
sampling from a putative universal IMF (§[T]). Elmegreen (1997, 1999) showed that the 
observed scatter is consistent with the variations of a(m) as a result of sampling from the 
IMF. Stellar-dynamical and unresolved binary-star biases lead to systematic deviations 
of a(m) away from the true value, and Kroupa (2001, 2002) has shown the observed 
alpha-values to be consistent with the canonical IMF (eq. II. ip for a large variety of 
stellar populations using theoretical models of initially binary-rich star clusters. 

As already deduced with some force by Massey (2003, and previous papers), there is no 
evidence for systematic shifts of a(m) with metallicity nor density. At the one extreme 
end of the scale of physical conditions, globular clusters have very similar MFs as young 
Galactic clusters and the Galactic field, while constraints on the massive-star content of 
globular clusters are such that the IMF could not have been too top heavy as otherwise 
the mass-loss from evolving stars would have unbound the clusters (Kroupa 2001). At 
the other extreme end, the super-solar Arches cluster near the Galactic centre has a 
massive-star IMF again very similar to the Salpeter/Massey one 15.21 below). 

Kroupa (2002) studied the distribution, / Q (a), of measured a values for stars more 
massive than a few M© finding it to be describable by two Gaussians, one somewhat broad 
but with symmetric wings, and the other exceedingly narrow, and both centered on the 
Salpeter/Massey value a = 2.35. This is very surprising because the various et(m) values 
were obtained by different research groups observing different objects. An equivalent 
theoretical data set shows a somewhat broader unsymmetric distribution offset from the 
input Salpeter/Massey value. This needs further investigation, because one would expect 
the theoretical data to be better "behaved" than the observational ones which include 
the entire complications of inferring masses from observed luminosities. Concerning this 
open question, Mafz Apellaniz & Ubeda (2005) point out that binning the star-counts 
into mass-bins of equal mass leads to biases that distort the shape of the IMF when the 
number of stars is small, which is often the case at high masses. 

Returning to the empirically-determined existence of an upper mass limit to stars 
near 150 Mq (§ [U, it remains to be understood why it does not seem to depend on 
metallicity: Theoretical concepts would suggest that feedback ought to play a role in 
limiting the masses of stars, but this implicitly contains a metallicity dependence as the 
coupling of radiation to the accreting gas is more efficient in metal- rich environments. 
Thus, we are left yet again with two open questions. 
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5.1. Back to the "Scalo vs Massey discrepancy" 

One important implication from this analysis of a(m) values for massive stars is that 
the above hypothesis (eq. 13 . 3|) must be discarded: if unresolved multiple systems were 
the origin of the Scalo vs Massey discrepancy, then f a (a) would have an asymmetric 
distribution stretching from the single-star value (a = 2.7) to the unresolved binary- 
star value (a = 2.3). This is not the case. The Scalo vs. Massey discrepancy therefore 
is either non-existing (perhaps because Scalo's analysis requires an update), or it has 
another hitherto unknown origin. The proposition by Elmegreen & Scalo (2006) that the 
IMF calculated from the present-day field MF may be steeper than the canonical IMF if 
the field was created during a lull in the star-formation rate deserves some attention in 
this context. 

5.2. An example: The Galactic central region 

Reports do exist that in extreme environments there is evidence for a top-heavy IMF. 
These often come from indirect arguments based on luminosity and available mass for 
star formation in star-bursts, for example: scaling the canonical IMF to the observed 
luminosity would lead to a mass in stars far superior to dynamical mass constraints from 
kinematics, requiring either a top-heavy (smaller a^) IMF, or an IMF truncated typically 
near or above 1 M©. Paumard et al. (2006) report evidence for a top-heavy IMF in both 
circum- nuclear disks in the MW. While the observations are quite convincing, § 14.51 needs 
to be invoked before taking such results as affirmative evidence for a variable stellar IMF. 

A case in point is the Arches cluster at the Galactic centre, where the conditions for 
star formation are very different to those prevalent in the solar vicinity or the disk of the 
LMC, by being in general warmer (^30 K instead of 10 K) and having a super-solar 
metallicity (-0.15 < [Fe/H] < +0.34). 

Klessen, Spaans, Jappsen (2006) perform an excellent state-of-the art calculations of 
star-formation under such conditions. They state in their abstract: "In the solar neigh- 
bourhood, the mass distribution of stars follows a seemingly universal pattern. In the 
centre of the Milky Way, however, there are indications for strong deviations and the 
same may be true for the nuclei of distant star-burst galaxies. Here we present the 
first numerical hydro-dynamical calculations of stars formed in a molecular region with 
chemical and thermodynamic properties similar to those of warm and dusty circum- 
nuclear star-burst regions. The resulting initial mass function is top-heavy with a peak 
at « 15 Mo, a sharp turn-down below w 7 Mq and a power-law decline at high masses. 
We find a natural explanation for our results in terms of the temperature dependence of 
the Jeans mass, with collapse occurring at a temperature of « 100 K and an H2 den- 
sity of a few 10 5 cm -3 , Theirs would be in agreement with the previously reported 
apparent decline of the stellar MF in the Arches cluster below about 6M©. 

Shortly after the above theoretical study appeared, Kim et al. (2006) published their 
observations of the Arches cluster and performed the necessary state-of-the art TV-body 
calculations of the dynamical evolution of this young cluster, revising our knowledge sig- 
nificantly. Quoting from their abstract: "We find that the previously reported turnover 
at 6M© is simply due to a local bump in the mass function (MF), and that the MF 
continues to increase down to our 50 % completeness limit (1.3 Mq) with a power-law 
exponent of T = —0.91 for the mass range of 1.3 < to/M© < 50. Our numerical calcu- 
lations for the evolution of the Arches cluster show that the T values for our annulus 
increase by 0.1-0.2 during the lifetime of the cluster, and thus suggest that the Arches 

cluster initially had a Gamma of —1.0 1.1, which is only slightly shallower than the 

Salpeter value." (r = 1 — a). 

This serves to illustrate the general observation that when detailed high-resolution 
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observations and accurate iV-body calculations are combined, claims of abnormal IMFs 
tend to vanish, irrespective of what theory prefers to have. Another example, the ONC, 
has already been noted above. 

5.3. Spheroids: old, very extreme star-bursts? 

An interesting, albeit indirect, result by Francesca Matteucci and her collaborators based 
on multi-zone photo-chemical evolution studies of the MW, its Bulge and elliptical galax- 
ies shows the MW disk to be re-producible by the standard Galactic-field IMF (eq. 13.21 
as « 2.7) but not the canonical IMF (eq. ll.ip . This is supported by the independent work 
of Portinari, Sommer-Larsen & Tantalo (2004). In contrast, the MW Bulge may require 
a rapid phase of formation lasting 0.01-0.1 Gyr and a top-heavy IMF with a w 1.95 
(Romano ct al. 2005; Ballero, Matteucci & Origlia 2006). For elliptical galaxies a similar 
scenario seems to hold, whereby the IMF needed to account for the colours and chemical 
properties is the canonical one (a^ ~ 2.3), but flattening (decreasing a^) slightly with 
increasing galaxy mass (Pipino & Matteucci 2004). An experiment, where the yields for 
massive stars are changed as much as possible without violating available constraints 
confirms this result - it seems not to be possible to understand the metallicity distribu- 
tion of MW Bulge stars with a standard Galactic field IMF. Instead, 0:3 ss 1.9 seems 
to be required (Ballero, Kroupa & Matteucci 2007). Nevertheless, uncertainties remain. 
Thus, Samland, Hcnsler & Theis (1997) find the Bulge to have formed over a time-span 
of about 4 Gyr with a Salpeter IMF using a two-dimensional chemo-dynamical code, 
while Immeli, Samland & Gerhard (2004) obtain formation time-scales near 1 Gyr using 
a three-dimensional chemo-dynamical code. Finally, Zoccali et al. (2006) find the "MW 
bulge to be similar to early-type galaxies, in being a-element enhanced, dominated by old 
stellar populations, and having formed on a timescale shorter than «1 Gyr" . "Therefore, 
like early-type galaxies the MW bulge is likely to have formed through a short series of 
starbursts triggered bythe coalescence of gas-rich mergers, when the universe was only a 
few Gyr old" . For low-mass stars, Zoccali et al. (2000) find the Bulge to have a similar 
MF as the disk and globular clusters. 

5.4. Composite populations 

The "Scalo vs Massey discrepancy" found in § 13.51 and its proposed solution in § 13.61 in 
terms of unresolved multiple systems was found, in § I5.1[ not to be possible. Again, as in 
§ 13.51 we might argue that trying to solve the discrepancy is not worth the effort, given 
the uncertainties. Indeed, the probable solution drops-out through an entirely different 
line-of-thought: 

It needs to be remembered that star clusters are the fundamental building blocks of 
galaxies (Kroupa 2005). Therefore, to be exact, all the stellar IMFs in all the clusters 
forming in one "star-formation epoch" (Weidner, Kroupa & Larsen 2004) of a galaxy 
must be added to calculate the global, galaxy- wide "integrated galactic IMF" (IGIMF). 
This has been pointed-out by Vanbeveren (1982, 1984) and discussed again by Scalo 
(1986), and formulated for an entire galaxy by Kroupa & Weidner (2003, 2005), noting 
that the star-clusters are distributed as a power-law embedded-cluster mass function 
with an exponent (3 w 2 ±0.4, where the Salpeter value would be 2.35. Also, an empirical 
relation exists limiting the mass of the most massive star in a cluster with stellar mass 
M ec i: m < m max (M ec i) (Weidner & Kroupa 2006), enhancing the IGIMF^IMF effect. 

Putting this together, an integral over all clusters formed together in one epoch results, 
and the composite IMF turns out to be steeper with o^igimf > «3 (= the canonical IMF 
value) for m^l.6M with a steep downturn at a mass m max . ga i. Both, o^igimf and 
the galaxy- wide maximum star mass, m max . ga i, depend on the star-formation rate of the 
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galaxy, and so the situation becomes complex with important implications for galactic 
spectro-photometric and chemical evolution studies given that the number of supernovae 
of type II is depressed significantly per unit stellar mass formed when compared to a 
universal Salpeter IMF, which is often assumed in cosmological applications (Goodwin 
& Pagel 2004; Weidner & Kroupa 2005). Koeppen, Weidner & Kroupa (2006) show that 
this IGIMF-ansatz naturally explains the observed form of the mass-metallicity relation 
of galaxies without the need to invoke selective outflows (which are not excluded though) . 

Applied to the MW, Scalo's w 2.7 comes-out naturally for an input canonical IMF. 
This is also required by spectro-photometric and chemical-evolution models of the MW 
disk (§ 15. 3[) . At the same time, for LSB galaxies the IGIMF must be steep, i.e. bottom- 
heavy. Again, this seems to be confirmed by observations (Lee et al. 2004). Is this, then, 
the solution to the "Scalo vs Massey discrepancy" ? 

Elmegreen (2006), however, opposes the notion that the IGIMF is different to the 
stellar IMFs found in individual clusters, mostly because the star-cluster mass function 
is not steep steep enough (/3 < 2), in which case IGIMF ss IMF as already shown by 
Kroupa & Weidner (2003). Much therefore rests on the detailed shape of the star-cluster 
initial mass function] Also, the existence of a physical m max (Af ec i) relation is of central 
importance to the IGIMF ^ IMF argument, but opposing views are being voiced (e.g. 
Elmegreen 2006). An argument against IGIMF ^ IMF is that some studies suggest these 
to be equal, e.g. in elliptical galaxies (§ 15. 4[) . However, an appreciable fraction of the 
research which Elmegreen fields in-support of his equality-conjecture actually support 
the inequality-conjecture. So this issue is still very much under debate and may need 
observational improvement by (i) better constraining the true initial star-cluster mass 
function and (ii) improving the empirical m max (M cc i) relation. Observational work is 
underway to test the IGIMF^IMF notion, and for example Selman & Melnick (2005) 
find the composite IMF to be flatter than the stellar IMF in the 30 Dor star-forming 
region, as predicted, but only with low-significance. 

6. Concluding remarks 

It would have seemed that when two mutually more or less exclusive theories of the 
origin of stellar masses make the same basic prediction concerning the variation of the 
average stellar mass with physical conditions, that this expected variation would be very 
robust and born out in observational data. Alas, the observational data on the IMF are 
resilient - they do not yield what we desire to see. The stellar IMF is invariant and can be 
best described by a two-part power-law form (eq. II. ip . This holds true for metallicities 
ranging from those of globular clusters to super-solar values near the Galactic centre, 
and for densities less than about 10 6 stars/pc 3 . Even the maximal stellar mass of about 
150 Mq seems to be independent of metallicity for ZfZ, 0.008. 

Given the quite total failure to account for the resilience of the stellar IMF towards 
changes, it is clear that this IMF-conservatism poses some rather severe challenges on 
liberal star formation theory. 

Only indirect arguments based on chemical evolution work seem to suggest the IMF 
to have been somewhat flatter for massive stars during the extreme physical conditions 
valid during the formation of the MW Bulge and elliptical galaxies. Variations of the 
galaxy-wide IMF (the IGIMF) among galaxies and also in time can be understood today 
despite the existence of a universal canonical IMF, but this issue is still very much in 
debate (Elmegreen 2006 vs Weidner & Kroupa 2006). The evidence noted in § 15. 31 appears 
to suggest that perhaps q^jgimf ~ 2.7 for the MW disk, o^igimf ~ 2.3 for E-galaxies, 
while «3, igimf ~ 1-9 for massive E-galaxies and the MW Bulge. Is this true? 
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